Project-Team:MULTISPEECH

Inria | Raweb 2016 | Presentation of the Project-Team MULTISPEECH | MULTISPEECH Web Site


	PDF	e-Pub

Previous |

Home | Next next

Section: New Software and Platforms

KATS

Kaldi-based Automatic Transcription System

Keyword: Speech recognition

Functional Description

KATS is a multipass system for transcribing audio data, and in particular radio or TV shows. The audio stream is first split into homogeneous segments that are decoded using the most adequate acoustic model with a large vocabulary continuous speech recognition engine. In this new software, the recognition engine is based on the Kaldi toolkit, and uses Deep Neural Network - DNN - based acoustic models. An extra processing pass is run in order to rescore the $n$ -best hypotheses with a higher order language model.

Participants: Odile Mella, Dominique Fohr and Denis Jouvet
Contact: Dominique Fohr
URL: Available online on the A||go platform: https://allgo.inria.fr/app/loriasts_kaldi

Previous |

Home | Next next